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Abstract 

The study of community networks has attracted considerable attention recently. 
In this paper, we propose an evolving community network model based on local 
processes, the addition of new nodes intra-community and new links intra- or inter- 
community. Employing growth and preferential attachment mechanisms, we gener- 
ate networks with a generalized power-law distribution of nodes' degrees. 
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1 Introduction 



Complex networks, evolved from the Erdos-Renyi random graph [1] , are pow- 
erful models for describing many complex systems in biology, sociology, and 
technology [2]. In the past decade, the explosion of the general interest in the 
structure and the evolution of most real-world networks is mainly reflected in 
two striking characteristics. One is the small- world property [3], which suggests 
that a network has a highly degree of clustering like regular networks and a 
small average distance among any two nodes similar to random networks. The 
small-world phenomenon has been successfully described by network models 
with some degree of randomness [3,4]. The other is the scale-free behavior 
[5], which means a power-law distribution of connectivity, P{k) ~ k~"' , where 
P{k) is the probability that a node in the network has k connections to other 
nodes and 7 is a positive real number determined by the given network. The 
origin of the scale-free behavior has been traced back to two mechanisms that 
are observed in many systems, growing and preferential attachment [5,6]. 



Preprint submitted to Elsevier 



4 February 2009 



Recently, with the progress of research in networks, many other statistical 
characteristics of networks appeared on the stage. Of particular renown is the 
so-called "community" (or "modularity"). That is to say, a network is com- 
posed of many clusters of nodes, where the nodes in the same cluster are 
highly connected, while there are few links among the nodes belonging to 
different clusters. For instance, groups are formed in scientific collaboration 
networks [7]. Also, it has been found that dynamical processes on networks 
are affected by community structures, such as tendencies spread well within 
communities [8] and diffusion between different communities is slow [9]. 

In the study of community networks, most research has been directed in two 
distinct directions. On the one hand, attention has been paid to designing 
algorithms for detecting community structures in real networks. A pioneering 
method was made by Girvan and Newman [7] , who introduced a quantitative 
measure for the quality of a partition of a network into communities. Later, a 
number of algorithms have been proposed in order to find a good optimization 
with the least computational cost. The fastest available procedures use greedy 
techniques [10] and extremal optimization [11], which are capable of detecting 
communities in large networks. On the other hand, research has focused on 
modehng of networks with community structures. In Ref . [12] , a static social 
network was introduced where individuals belong to groups that in turn be- 
long to groups of groups and so on. In Ref. [13], a networked seceder model 
was suggested to illustrate group formation in social networks. In Ref. [14], a 
growing bipartite network for social communities with group structures was 
proposed. Each of those models is constructed based on one aspect of reahty. 

In this paper, we introduce a network model with communities that gives a 
reahstic description of local events [15,16,17]. The model incorporates three 
processes, the addition of new nodes intra-community and new links intra- 
or inter-community. Using growing and preferential attachment mechanisms, 
we generate the community network with a good right-skewed distribution of 
nodes' degrees, which has been observed in many social systems. 



2 Model 



The Barabasi- Albert network [5] only describes a particular type of evolving 
networks, the addition of new nodes preferential connecting to the nodes al- 
ready present in the network. Systems in the real world, however, are much 
richer. For example, in scientific collaboration networks, a multidisciplinary 
scientist is not only collaborate with scientists in his research fields but also 
has a stronger desire to collaborate with scientists in other fields. In friendship 
networks, a person usually makes friends with people belonging to different 
communities besides the community he belongs to. To give a realistic descrip- 
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tion of the network construction like that, we introduce a growing model of 
community networks based on local events, the addition of new nodes intra- 
community and new links intra- or inter-community. The proposed model is 
defined as follows. 

We start with M (> 2) isolated communities and each community consists of 
a small number n of isolated nodes. At each time step, we perform one of the 
following three operations. 

(i) With probability p we add a new node in a randomly chosen community. 
Here the randomly chosen means that the community is selected according to 
the uniform distribution. The new node is only connected to one node that 
already present in the selected community. We denote it as the uih commnuity. 
The probability that node i in community u will be selected is proportional 
to its intra-community degree 

j^intra _j_ 

where the sum runs over nodes in community u and k™f^ is the intra-community 
degree of node i in community u. 

(ii) With probability q we add a new link in a randomly chosen community. 
For this we randomly select a node in a randomly chosen community u as the 
starting point of the new link. The other end of the link is selected in the same 
community with the probability given by Eq. (1). 

(iii) With probability r (= 1 — p — g) we add a new link between two commu- 
nities. For this we randomly select a node in a randomly chosen community u 
as the starting point of the new link. The other end i of the link selected in 
the other community v is proportional to its inter-community degree 

jointer _j_ 

J-j-^^mter^ = Kmter , -i N' (2) 

where the sum runs over nodes in all communities except for community u 
and k™^^^ is the inter-community degree of node i in community v. 

After t time steps, this scheme generates a network of Mn + pt nodes and t 
links. The parameters p, q, and r control the network structure. In the case 
of small r, the generated network will have a strong community structure. 
Notice that whatever process is chosen in the network growth, only one link is 
added to the system at each time step (duplicate and self-connected edges are 
forbidden), however, this is not essential. We choose link probabilities l\{k"^^'^'^) 
and n(^r**^'^) to be proportional to k^^^^^ + 1 and /c^"*'^'" -|- 1, respectively, such 
that there is a nonzero probability of isolated nodes acquiring new links. 
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3 Degree distribution 



In our community network, the degree of a node consists of two parts, the intra- 
community degree and the inter-community degree. Increase in the node's 
connectivity can be divided into two processes, the increases of the intra- 
community degree and the inter-community degree. In each process, we as- 
sume that A;"^*"^'^ and kf^^^'^ change continuously, and the probabilities Y[{k"^^^^) 
and n(^r*'^'^) can be interpreted as the rates at which k™^'^^ and k™^^^ change, 
respectively. Thus, the operations (i)-(iii) all contribute to ki, each being in- 
corporated in the continuum theory as follows. 

(i) Addition of a new node in a randomly chosen community with probability 
P ■ 

dt ^ME,(fci^f +!)■ 

(ii) Addition of a new link in a randomly chosen community with probability 
q : 

dt ~^^N^ M (^LT + 1) 

where N is the number of total nodes. The first term on the right-hand side 
(rhs) corresponds to the random selection of one end of the new link, while 
the second term on the rhs reflects the preferential attachment (Eq. (1)) used 
to select the other end of the link. 

(iii) Addition of a new links between two communities with probability r : 



d^T^_ 1 k^f + 1 

~dr~''^N^^ ~M>j:.^^,j{k^f + iy- 

The first term on the rhs represents the random selection of one end of the new 
link, while the second term on the rhs considers the preferential attachment 
(Eq. (2)) used to select the other end of the link in the other community. 

Combing the contribution of above processes, we have 



aj,intra „ i „ Lintra i -i 

'-"^u,i _ p + q i^u,i + -*- Q 

dt ~ M Ei(<r + 1) A^' 
dKT^r^ M-1 + 1 

dt N M E„^„;,(A::;;f + 1)' ^ ' 

with 
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j 3 

, I 1 , Mn + pt 
3p + 2q 

E(C + i)= E + 

^ M-1 ,M- 
= 2tr — — h (Mn + pt) 



M ' ' M 

(2-p-2q)(M -1) 

^ ^ U +(M - I n. 



We can simplify Eqs. (6) and (7) for large t 



dKf\^ P + q + q .g. 

dt ^3p + 2q t pt' ^ ' 

dKf l-p-q iKf + l) 1-p-q 

dt ^2-p-2q t pt ' ^ ' 



The boundary conditions of the intra-community degree and the inter-community 
degree at initial time tg can be estimated in the sense of mathematical expec- 
tations, k^uf^i^s) — P + q and k™^^^{ts) — r, respectively. So we write the 
solutions of Eqs. (8) and (9) 



^jntra(^) _ p' + p' + '^P^ + 4^ + pq' + 2q' ^t _ P^+4pg + V^-LQ^ 



p{p + q) ts p{p + q) 

+ p — p^,t. 2 — ^ 

£_(_)2-p-2q 

P ^ts' P 



In random networks, the degree distribution can be calculated by 



p{k) = ^i:6{h{t)-k), (12) 
^ i=i 



which gives 
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Fig. 1. (Color online) Log-log representation of distributions of intra-community 
degree (a), inter-community degree (b), and total degree (c) of nodes. All the sim- 
ulation results (squares) display good right-skewed distributions. The circles in (a) 
and (b) denote analytical results predicted by Eqs. (13) and (14), respectively. The 
solid line in (c) is guide to the eye with power-law decay exponent 7 = 3.0. The 
experiment network has a total number of nodes = 10^ with parameters M = 10, 
n = 5, p = 0.4, and q = 0.4, respectively. 



3p^ + 2pq 



p2 _|_ 2q^ + 4pq + 2p^q + pq^ + p^ 
p^ + Apq + 2q^ + {p^ + pq)t'^^" 



inter \ 



p2 _|_ 2g2 _|_ 4pg _|_ 2p2g _|_ pq2 _|_ p3 

2p — 2pq — p^ 
— p ~ Aq ~ 2p2 + 2g2 + 2p'^q + pq^ + p^ 

2-2q + pt''^''' 1 



2 + p — 2q — pq — p^ 



(13) 



(14) 



Thus, the degree distribution of our network obeys a generalized power-law 
form 

P{k) - [A{p, q)k + B{p, g)]-^(f''?). (15) 



In Fig. 1 we present numerical results of distributions of the intra-community 
degree, the inter-community degree, and the total degree of nodes in log-log 
scale. The experimental network is generated by the proposed scheme with 
N = 10^, M = 10, n = 5, p = 0.4, and q = 0.4, respectively. The distributions 
of the intra-community degree and the inter-community degree, shown in Figs. 
1(a) and 1(b), agree with analytical results of Eqs. (13) and (14), respectively. 
The small deviations between computer simulations and analytical solutions at 
both ends of the distributions appears to be the mathematical approximation 
of the boundary conditions and the finite size effect due to the relatively small 
network sizes used in the simulations. According to the evolving rule of our 
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Fig. 2. (Color online) The degree distribution of econophysicists (squares) of an 
econophysics scientific collaboration network [19]. The circles correspond to com- 
puter simulations of our model with parameters M = 10, n = 2, p = 0.4, and 
q = 0.4, respectively. 



network, nodes with larger intra- (or inter-) degree have higher probability to 
gain new links, then the usual degree preferential attachment is reasonably 
kept. This means that the right-skewed character of the network, such as 
the node's total degree, will retain. As shown in Fig. 1(c), the total degree 
distribution of nodes is well expected showing a good right-skewed character, 
which is reasonably in agreement with the condition of many realistic systems 
[18]. 



To illustrate the predictive power, we also compare the numerical result of our 
network with the statistics of an econophysics collaboration network. In the 
econophysics collaboration network, each node represents one scientist. If two 
scientists have collaborated one or more papers, they would be connected by 
an edge. Zhang et al. took the largest connected component of this network, 
which includes 271 nodes and 371 edges, and provided the best division, i.e., 
M = 10 [19]. In Fig. 2 we plot the degree distribution of econophysicists of the 
econophysics collaboration network which is fitted by computer simulations 
of our network starting with 10 communities. To gain p and g, we fit the 
connectivity distribution P{k) obtained from this collaboration network with 
Eq. (15), obtaining a good overlap for p = 0.75 and q = 0.15 (Fig. 2). 
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4 Conclusion 



Networks with community structures underlie many natural and artificial sys- 
tems. It is becoming essential to model and study this kind topological feature. 
We presented a simplified mechanism for networks organized in communities, 
which corresponds to local events during the system's growth. The generated 
network is highly clustered and has a good rightskewed distribution of con- 
nectivity, which have been found very common in most realistic systems. The 
present paper only suggests a simple way for generating community networks. 
The shape of the resulting network is deterministic in some extent. It is more 
interesting to model the evolution of communities, especially the self organiza- 
tion (or emergence) of communities in the natural world [20], e.g., expansion 
and shrinkage, which is left to future work. 
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